High Performance Parallel LOBPCG Method for Large Hamiltonian Derived from Hubbard Model on Multi-GPU Systems

نویسندگان

چکیده

Abstract The physical property of the Hubbard model can be understood by solving eigenvalue problem for Hamiltonian derived from model. Since is a large sparse matrix, an iteration method usually utilized problems. One effectual solvers this LOBPCG (Locally Optimal Block Preconditioned Conjugate Gradient) method. tuning strategies on GPU systems when all vectors are stored in device memory have been proposed. In research, we propose parallel multi-GPU system and some host memory. When used multi eigenpairs (eigenvalues corresponding eigenvectors), number vectors, whose size same as dimension Hamiltonian, proportional to eigenpairs. On other hand, consumption non-zero elements significantly reduced considering regular arrangement elements. Therefore, execute GPUs, transferred between needed. cost data transfer very large, also optimization it. simulation result shows that effective high performance computing.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High performance MRI simulations of motion on multi-GPU systems

BACKGROUND MRI physics simulators have been developed in the past for optimizing imaging protocols and for training purposes. However, these simulators have only addressed motion within a limited scope. The purpose of this study was the incorporation of realistic motion, such as cardiac motion, respiratory motion and flow, within MRI simulations in a high performance multi-GPU environment. ME...

متن کامل

Communication Avoiding Neumann Expansion Preconditioner for LOBPCG Method: Convergence Property of Exact Diagonalization Method for Hubbard Model

Application of Eisenstat-SSOR Preconditioner to Realistic Stress Analysis Problem by Parallel Cache-Cache Computing Kuniyoshi Abe1, Seiji Fujino2 1Faculty of Economics and Information, Gifu Shotoku University, Japan; 2Professor Emeritus, Kyushu University, Japan Communication avoiding Neumann expansion preconditioner for LOBPCG method: Convergence property of exact diagonalization method for Hu...

متن کامل

A Multi Objective Optimization Model for Redundancy Allocation Problems in Series-Parallel Systems with Repairable Components

The main goal in this paper is to propose an optimization model for determining the structure of a series-parallel system. Regarding the previous studies in series-parallel systems, the main contribution of this study is to expand the redundancy allocation parallel to systems that have repairable components. The considered optimization model has two objectives: maximizing the system mean time t...

متن کامل

Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems

The Kernel Polynomial Method (KPM) is a wellestablished scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization potential and feasibility of peta-scale heterogeneous CPU-GPU implementations of the KPM. At the node level we show that it is possible to decouple the spars...

متن کامل

Performance Prediction for Large Scale Parallel Systems

In both the design of parallel computer systems and the development of applications, it is very important to have good performance prediction tools. This paper describes a new approach -PetaSIM, which is designed for the rapid prototyping stage of machine or application design. Computers, networks and applications are described as objects in a Java IDL (Interface Definition Language) with speci...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-10419-0_1